perm filename FONT.PUB[PRO,MUS] blob sn#090347 filedate 1974-03-04 generic text, type C, neo UTF8
COMMENT ⊗   VALID 00008 PAGES
C REC  PAGE   DESCRIPTION
C00001 00001
C00002 00002	.DEVICE XGP
C00003 00003	.SELECT B
C00008 00004	.BEGIN FILL ADJUST
C00013 00005	.SELECT A
C00018 00006	.SELECT 5
C00022 00007	.SELECT 5
C00026 00008	.GROUP SKIP 2
C00030 ENDMK
C⊗;
.DEVICE XGP
.FONT 3 "FIX25"
.FONT 2 "SUP"
.FONT 1 "NGR30"
.FONT 4 "GRK30[MA,JAM]"
.FONT 5 "BASI30"
.FONT 6 "GRKL40[MA,JAM]"
.FONT 7 "SUBSUB"
.FONT 8 "SUB"
.FONT 9 "FIX30"
.FONT A "BASB30"
.FONT B "CLAR30"
.FONT C "BDJ25"
.!XGPCOMMANDS←"/XLINE=10";
.TURN ON "%↑↓";
.EVERY HEADING(,{PAGE},)
.PAGE FRAME 39 HIGH 80 WIDE
.TITLE AREA HEADING LINES 1 TO 2
.AREA TEXT LINES 3 TO 38
.TITLE AREA FOOTING LINE 39
.SELECT 1
.NOJUST
.NOFILL
.NEXT PAGE

.SELECT B
        SIMULATION OF MUSIC INSTRUMENT TONES
.SELECT 1

.BEGIN FILL ADJUST

In this part  of the proposal we  will discuss our approaches  to the
computer simulation of  music instrument tones.  The main goal of our
research is the development of a  powerful, general purpose technique
for the simulation of auditory  signals that will have the perceptual
complexity  and naturalness of the musical  sounds which occur in the
real world.  The fundamental concern  here is  with the synthesis  of
natural  timbres of  the extremely  varied and  highly  complex tones
which occur in music.   Timbre can  be psychoacoustically defined  as
that multidimensional attribute of sound by which  a listener is able
to  differentiate  two auditory  signals  which are  equal  in pitch,
loudness, and duration.  Although a general model does  not presently
exist  for the  perception of  timbre, the  psychoacoustical research
which  has  been done  in  the last  century implicates  a  number of
physical properties  of tone  which influence  timbre, including  the
spectral distribution of energy  and the temporal pattern of spectral
evolution.

Supporting  this  goal  to  develop  a  technique  for  the  computer
simulation  of  natural  tones is  a  concurrent  research effort  to
formulate a  perceptual model  which is  able to  desribe  the  human
processing of musical sounds. The theoretical problem is to establish
a  set of acoustical dimensions, those  which are actually salient in
the perception  of musical  timbre,   and to  design a  computational
algorithm that enables the user to exert the highest level of control
with respect to these dimensions for the purposes of simulation.  The
empirical problem which follows is the determination of those aspects
of  the signal  which are  actually important  in the  perception and
identification of a  sound. Necessarily  included is a  study of  the
distinctive  features  of signals,    the  investigation of  physical
conditions  which contribute to the naturalness  of a signal. Related
research  should  examine  the  general  characteristics   of  timbre
perception,  looking  into  the  effects  of such  phenomena  as  the
categorical identification of musical sounds.

The discussion which  follows is concerned  with a desription  of our
systematic approach to a general model for simulation, which is based
on  perceptual  validation  at  every  step.  Two  initial strategies  
for  simulation,  the  analysis-based  additive  synthesis  and   the
frequency-modulation  synthesis, are seen to  converge on this model.
The first method is concerned with the procedures  of data reduction,
in that we start with  the most complete, complex information about a
real signal through its analysis.  We then systematically step in the
direction of the most  simple representation of the signal  which can
be  used successfully to  synthesize a perceptual  replication of the
original tone.   To this  end we examine  the perceptually  important
aspects of physical signals. 
.END
.BEGIN FILL ADJUST
The second  approach towards  a model  for simulation  begins with  a
simpler,    more  easily-controlled  process,    frequency-modulation
synthesis of sound,   which  allows the user  to directly  manipulate
aspects of the signal that we have found to be very powerful in terms
of certain perceptual cues for music instrument tones. The success of
this method first  came as a surprise  to many, in that  the physical
waveform that  it generates is strikingly different  from any natural
signals.  However,  upon inspection, the reasons for this success may
be determined,  and we may thereby learn what physical dimensions are
important  in  the perceptual  achievement.   The  direction  of this
approach is to increase the  complexity of the process,  until  there
is control  over a very wide  set of features which  occur in natural
tones.

The approaches described are  not found to proceed independent of one
another. Findings in one technique can immediately be applied to  the
other, and a system of cross-verification is thereby established. The
ultimate  model  for simulation  will  draw from  findings  from both
approaches. A common  aspect of  both approaches is  the concern  for
perceptual   verification  of   the  particular   results  at   hand.
Experimental  methods of  perceptual psychology are  employed for the
rigorous verification of the success of simulation,  in  terms of the
discriminability of a simulation from  real tones and in terms of the
naturalness of simulation. 

The ultimate  aim  of our  research is  the  development of powerful,
general purpose  algorithms  for the  computer simulation  of natural
tones. Accordingly, we will investigate the general properties of the
perceptual processing  of such  signals.  The  general model  for the
perception  of different sets  of tones will  provide information for
the  construction   of  perceptually-based  higher-order   simulation
algorithms.   We employ a spatial model  for the subjective structure
of  the  perceptual  relationships  between  signals.    Research  is
directed at  uncovering the dimensionality  of the  subjective space,
the psychophysical relationships which are structurally correlated to
this space,  and the properties of the space.  The existence  of such
constraints  as categorical  boundaries  will be  investigated in  an
attempt to  assess the continuity of the subjective space for timbre.
In the  same regard,  we will  also  examine the  effects of  musical
training or context on the structure of the space.  The model will be
evaluated by our ability  to predict the mappings  of real and  novel
tones.
.END
.GROUP SKIP 3
.SELECT A
                    ANALYSIS-BASED ADDITIVE SYNTHESIS
.SELECT 1

.BEGIN FILL ADJUST
This section  will be devoted  to a discussion  of the  simulation of
musical  tones by way  of analysis-based additive  synthesis. We will
first  indicate  the  computational  methods  involved  in   additive
synthesis, and then go on to  describe the analysis of of real tones,
which  provides  the  initial information  for  simulation.   Graphic
techniques which aid the researcher in the examination of the complex
physical properties of tones will be described.

Next  we  will  discuss   the  perceptual  validation  of  the  basic
analysis-synthesis model.    We  will  then go  on  to  describe  the
direction of our current research, the reduction  of the very complex
physical data which is obtained from the analysis of real tones. Data
reduction will be found to depend on an empirical verification of the
perceptual fitness  of measures  taken: the success  or failure  of a
current  data reduction strategy  will contribute to  a general model
for the perceptual processing of musical tones;  the model,  in turn,
will direct  the next stage of data  reduction. We will also indicate
future research which  is planned  for the approach  to our  ultimate
end:  a powerful,    general,   and  easily-controlled algorithm  for
simulation  which is  based on  a comprehensive perceptual  model for
natural tones.
.END
.NEXT PAGE
.SELECT 3
SYNTHESIS AND ANALYSIS TECHNIQUES
.GROUP SKIP 1
.SELECT 5
synthesis
.SELECT 1
.BEGIN FILL ADJUST
In  additive  synthesis  we  physically  model  the  digitized  sound
waveform  as a sum  of sinusoids with  slowly time-varying amplitudes
and phases.  Equation (1) summarizes this formulation.  The sinusoids
in equation (1) represents the partial tones.
.END


          M
(1)  F%8α%1 = %6S%1 A%8j%1 sin(%4w%8j%1hα+%4q%8j%1)
         j=1

Notation:    F%8α%1 is the sampled, digitized waveform at time αh
	     h is the time between consecutive samples
	     A%8j%1 is the amplitude of the jth partial tone
	         and is assumed to be slowly varying with time
	     %4q%8j%1 is the phase of the jth partial tone
	         and is assumed to be slowly varying with time

.BEGIN FILL ADJUST
One can see that from this model, if we can determine the
parameters A%8j%1 and %4q%8j%1 of a tone from a musical instrument,
we can easily synthesize the waveform F%8α%1 from those parameters
by use of equation (1). The degree to which this form of synthesis
has been successful will be discussed below.
.END
.GROUP SKIP 2
.SELECT 5
analysis for additive synthesis
.SELECT 1
.BEGIN FILL ADJUST
We now turn to the problem of determining the parameters
A%8j%1 and %4q%8j%1 of a music   instrument tone. To aid the
analysis, we must assume the frequencies of the partial tones,
%4w%8j%1, are nearly harmonically related. That is, there is some
frequency, %4w%1, such that %4w%8j%1 is approximately j%4w%1. We shall
call this frequency %4w%1 the fundamental frequency of the tone.

The method we have found most useful we call the "hetrodyne
filter." This is described in detail in reference [**Moorer**] and is
derived briefly in appendix A. Basically, the method is as follows:
First, compute the following two summations at each point in time α.
.END

         α+N-1
(2)  %9a%8j%8α%1 = %6S%1 F%8i%1sin(j%4w%80%1ih+%4f%80%1)
          i=α

         α+N-1
(3)  %9b%8j%8α%1 = %6S%1 F%8i%1cos(j%4w%80%1ih+%4f%80%1)
          i=α

.NEXT PAGE
From these, we calculate two more sequences:

(4)     A%8jα%1 = (%9a%8jα%22%1+%9b%8jα%22%1)%21/2%1

(5)     %4f%8jα%1 = atan(%9a%8jα%1/%9b%8jα%1)

.BEGIN FILL ADJUST
The summations are taken to be over one period of a sinusoid
of frequency %4w%80%1, that is, Nh%4w%80%1 = 2π. This places
somewhat of a restriction on the frequency of analysis, %4w%80%1,
because in the discrete domain, the period, N, is restricted
to integral values. This has not proved a problem in our experience.

If the partial tones are nearly harmonically related, if the
parameters of the tone vary slowly with time, and if %4w%80%1 is nearly
equal to the fundamental frequency of the tone, then A%8jα%1 and
%4f%8jα%1 will indeed be approximations to the actual amplitudes and
phases of the partials of the tone under analysis.

Let us review the procedure. First, we do the computations indicated in
equations (2), (3), (4), and (5) for each of the partials of a
tone, over the entire time interval spanned by the tone. The output
A%8jα%1 and %4f%8jα%1 may then be used in equation (1) to synthesize a new
tone that hopefully retains much of the character of the original.
.END
.GROUP SKIP 2
.SELECT 5
graphic techniques
.SELECT 1
.BEGIN FILL ADJUST
As aids to the researcher, we have designed several different methods
for displaying  the results of analysis. The  output of the hetrodyne
filter can, of course, be displayed as a number of isolated amplitude
and phase plots,  covering the individual components, as  is shown if
figure 1  for the first sixteen harmonics of a violin tone. The total
duration of the tone  is about 400 milliseconds and  it's fundamental
frequency  is about 311  Hz. It  has been  found most  informative to
simultaneously view the entire set of harmonics together.  One method
is the use  of perspective plots. Figure 2  shows such a plot  of the
amplitudes of the  partials of the same violin tone.  The fundamental
harmonic is in the background of  the picture, the highest is in  the
foreground.  This allows  us to  more readily  discover relationships
among  the harmonics.   This  perspective plot  can be
spatially rotated on-line,  so  that the observer is able to  see the
three-dimensional representation from any  angle.  This has been very
helpful in getting a more comprehensive understanding of the behavior
of the partials of a tone as a function of time.

Another form of showing  the temporal evolution of the  partials of a
tone  is the  sequential line-spectrum plot.   This  strictly on-line
display presents two-dimensional frequency by amplitude plots  of the
partials  for a  given  instant in  time.   The  observer traces  the
changes   in  these  parameters  through   a  sequential  display  of
instantaneous values from the beginning to the end of the tone.  This
animation  technique has  been  found to  reveal  further information
gathered from analysis.

.NEXT PAGE
A fourth way of examining the output of the hetrodyne filter, inspired
by the  sonogram,  is given in figure 3.  Here, the thickness of each
bar is proportional to the log of the amplitude of that harmonic. The
vertical   position  represents   its  instantaneous   frequency,  as
determined from the phase drift of the harmonic.  The utility of this
display is it's representation of the phase  information with respect
to amplitudes.
.END
.GROUP SKIP 2
.SELECT 3
PERCEPTUAL VALIDATION OF ANALYSIS-SYNTHESIS STRATEGY
.SELECT 1
.BEGIN FILL ADJUST
It is  necessary  to confirm  the utility  of our  analysis-synthesis
strategy  for the  simulation of  musical tones on  the basis  of the
perceptual success of the resultant auditory simulation.  That  is to
say,   we must  establish that  the signal which  is produced  by our
analysis-synthesis technique is indiscriminable from the recording of
the original sound.  The crutial test,  then,  is a comparison of the
sound which has been produced by additive synthesis with the original
musical tone that  was analyzed with  the hetrodyne filter.  Informal
experimentation has shown that the analysis-synthesis method produces
an extremely  convincing replication of the  original signal, and the
perceptual  validity  of  the  method  has  been  thereby  informally
established.  This  is  in  agreement  with  the  findings  of  other
investigators   who  have   attempted  to  verify   similar  analysis
techniques by  comparing  tones synthesized  from analysis  with  the
original signals (Risset, 1966; Freedman, 1967, 1968; Beauchamp, 1969).

Among  the  types   of  natural  musical  signals  which   have  been
sucessfully  simulated by the analysis-synthesis  procedure are tones
from the string,   woodwind,   and brass  families of the  orchestra.
Specifically, we have been able to reproduce tones of various pitches
and durations from the following instruments:
.END
.GROUP SKIP 1
.NOFILL
.NOJUST
	violin,  viola,  cello,  string bass,  trumpet,  trombone,
        french horn,  baritone horn,  oboe,  english horn,  flute,
	Bb clarinet,  alto clarinet,  bass clarinet,   alto flute,
	alto sax,  soprano sax,  bassoon.
.GROUP SKIP 1
.BEGIN FILL ADJUST
The  comparisons  between  the  original  recorded  tones  and  their
respective  simulations  by  experienced  listeners,   composers  and
acousticians, have demonstrated the perceptual validity and potential
power of our analysis-synthesis technique.
.END
.GROUP SKIP 2